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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 
APPLICATION FOR PATENT 

SYSTEM AND METHODS FOR CLASSIFYING 
ANOMALIES OF SAMPLE SURFACES 
Inventors: Wayne Chen, Andrew Zeng and Mustafa Akbulut 

BACKCtROT JND of trr invftstttam 

This invention relates in general to systems for analyzing anomalies of sample 
surfaces, such as those of semiconductors and those of magnetic or optical disks or 
flat panel displays, and in particular, to a system for detecting and classifying 
N anomalies of such surfaces. 

Achieving the best possible financial performance drives the continuous 
shnnkage of the critical dimensions of integrated circuit devices that are fabricated 
on surfaces of semiconductor wafers. This shrinkage requires a flatter wafer surface 
due to hmited lithographic depth of focus (DOF) and higher circuit packing density 
along with many other factors. Chemical mechanical planarization or polishing 
(CMP) has become an enabling technology to fulfill these requirements for the 
semiconductor industry. Since critical defects scale with the design rule defect 
detects tools are required to have better sensitivity; the required sensitivity is 
roughly half the size of the critical dimension. Unlike many other semiconductor 
processes, CMP is unique in its requirement that the slurry be both chemically active 
and mechanically abrasive during polishing. The combination of the chemical 
reaction and abrasive behavior of the CMP process creates a unique set of defects 
such as microscratches, chatter marks, slurry residue, etc. The adoption of CMP 
presents a great opportunity to have a better DOF budget, but poses a unique 
challenge in the inspection of processed CMP wafers, along with achieving higher 
chip yield by using appropriate yield management techniques. 

Since CMP has become an essential practice in almost all chip manufacturing 
with a design rule of 0.25pm or smaller, there is an urgent need to have a set of 
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metirobgy tools to, can no. only detect bu, also classify those defects. In general 
different types of defects have different sources and different impacts on the final' 
device yield. Classifying torn in rea. toe will signjficantiy reduce ^ 
results. The shrinkage of the ditnensions of devices leads to an increase of the 
density of chips to, can be produced on a wafer; .herefore, more value has been 
produced from each wafer. This p,aces ever-greater demands on yield management 
and ,n particular on defect inspection and classification during to CMP process the 
loss of a single production wafer leads to a significant revenue loss Driven by 
havmg shorter time for achieving and maintaining high-yield for high-value added 
produce, to ideal defect inspection system should be able ,o deliver to necessary 
■nfortnanon to have a more comprehensive solution, which includes detecting all 
types of defects, classifying ,hem, analyzing them, and recommending corrective 
actions 



Unhke particles, some of the CMP-induced defects, such as microscratches 
and chatter marks, cannot be removed by post-CMP. cleaning and it is important to 
sort them out and minimize their occurrence since they may impact yield For the 
cleanable defects such as particles, which may have no significant impact on the final 
yield, the classified defect counts will be used for process control. 

Due to the nature of the process, CMP microscratches are very difficult if 
not impossible, to avoid. The larger particles in the slurry or fall-on particles from 
the pad conditioner mainly induce this type of defect. Depending upon their 
dimensions and locations, these defects may adversely affect the yield of the device 
Needless to say, detecting and classifying these defects wil. be essential for the 
des,red process monitoring and control for achieving the best possible yield of a 
*5 production line. 

Typical CMP-induced defects can be divided into two categories: extrinsic 
defects, which are ,he result of to presence of foreign materials, and intrinsic 
defects, which are to imperfections created on to polished material.. Extrinsic 
defects include slurry residue, surface particles, and embedded particles Intrinstc 
defects include such defects as microscratches, chatter marks, wa.er marks long 
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scratches (continuous and spiral), pits, rip-ou, and dishing. Due to the comptexitv 
of the patterned wafers, additional types of defect are present. These defects are 
related to the manufacturing process. 

As the name implies, residual slurry results from incomplete or improper 
■ cleaning after the CMP step. Surface particles could be picked up from anywhere 
and are not necessarily CMP related. Embedded particles may result from existing 
surface particles, abraded film particles, flakes or particles from the slurry that are 
ground m by the down force of the polishing head. 

Therelativemotionofthewaferandthepadproducesamicroscratch when 
larger particles are present in the slurry. In particular, slurry particles larger than , 
|un m s,ze can be significant contributors to microscratch generation. Other factors 
that contribute to microscratch generation include unsuitable rinsing and buffing 
steps following the CMP step. Although buffing with soft pads over hard pads 
reduces the formation of microscratches, incorrect buffing produces worse results 
The dilute-HF dip step, which follows the buffing polish step, can increase the " 
number of micro-scratches, enlarges existing microscratches, and leads to failure of 
the device. 

The Surfscan SP1™® wafer inspection system from 
Corporation, the assignee of the present application, has been used for inspecting 
unpatterned semiconductor wafers. This system has very high sensitivity and can 
classify large scratches, such as scratches longer than 1500 nm. However many 
scratches produced during the CMP processors are smaller than that and may even 
be smaller than the spot size of the laser beam used in the system. Scratches that are 
shorter than a predetermined length such as 1500 nm are referred to herein as 
Woscratches. « Most of the microscratches may be smaller than the spot size of 
the laser beam used in the Surfscan SPI™® system, so that they appear as light 
pent defects (LPDs). Therefore, without using a classification method the user 
would not be able to distinguish between microscratches from particles or other 
types of point anomalies such as pits on the wafer surface. 
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m order to distinguish such microseratches from other point defects such as 
particles and pits, tools such as scanning electron microscopes or atomic force 
nucroscopes havebeen used. When there are a large number of defects on the wafer 
surface, such method is time consuming and not practical for use in a production 
environment. Itis, therefore, desirable to provide an improved system for classifying 
anomalies of a surface in which the above-described difficulties are not present. 

SUMMAR V QF THE TMVENTTON 

This invention is based on the observation that, by varying the sensitivity by 
which the anomalies are detected to provide output(s) or by varying a threshold 
when data on the anomalies are analyzed, more information useful for classifying the 
anomalies becomes available. By using outputs obtained at two or more different 
detection sensitivities, or by processing the data on the anomalies using two or more 
different thresholds, it is possible to distinguish between microseratches and 
particles, pits or other point defects. At least one classification of the anomalies may 
then be provided. Preferably, this can be done without requiring a user to scan the 
sample surface more than one time. This will permit a user to distinguish between 
microseratches from particles and other point defects and adjust the CMP or 
cleaning process in real time in an on-line production process or post-processing as 
20 required to improve yield. 

BRIE F DESCRIPTION nv T HE PR A Wmra 

Fig. 1 is a schematic view of a sample inspection system employing a 
substantially normal channel and an oblique channel useful for illustrating the 
25 invention. 

Fig. 2 shows partially in perspective and partially in block diagram form a 
system for inspecting anomalies of a patterned semiconductor wafer surface useful 
for illustrating the invention. 

Fig. 3 is a cross-sectional view of an imaging type defect detection system 
useful for illustrating the invention. 
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Fig. 4 is a schematic view of , simulated distribution of anomalies from a 
deteetton^temsuchasthoseshowninFigs. ,-3 and .0, where size of the anomaly 
is indicated in microns. 

Fig 5 is a schemata view of the simulated distribution of Fig 4 where the 
anomal.es have been grouped or clustered using an algorithm of this invention 

F.g. 6 ,s a schematic view of the simulated distribution of anomalies on the 
same samp.e surface as that illustrated in Figs. 4 and 5, but using a different 
detection sensitivity or threshold for analysis to illustrate an embodiment of the 
invention. 

Figs. 7A and 7B are graphical illustrations of the intensity of radiation 
detected from elongated anomalies such as scratches to illustrate an embodiment of 
«he mvention, where the cross-section of the scratches in Figs. 7A. 7B are different 
F.gs. 8A and 8 B arc graphical illustrations of the intensity of radiation 
detected from the same elongated anomalies as those illustrated in Figs 7A, 7B bu, 
at a d,fferen, detection sensitivity or threshold for analysis from that of Figs 7 A, 78 
Ftg. 9 is a schematic view of the scattering cross-section of an non-elongated 
anomaly, such as a particle. 

Fig. ,0 is a schematic diagram of a sample inspection and processing system 

20 sampll ,ra " nS i " t0 " na ' i0 '' *™ ** inSPeC,i0 " "* ° f a 

For simplicity in description, identical components are identified by the same 
numerals in this application. 
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DETAILED PFSPR gTipN OF THF. EMBODBVENT 

^°^ove,theSurfs^^^^ 
of the present application, KLA-Tencor Corporation, may be used for inspecting 
semiconductor wafer surfaces, such as those of unpatterned wafers, as well as other 

^play type devices. A description of such system is set forth in US Patent 
A PP .icationSenalNo.08/933,771,fd e donSeptemberl9, 19 97,enti,ed "Improved 
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Sat^pectton System." TWa ,p plica «„„ is incorporated herein by reference h 
enttrety, and is referred t0 herein as the 

fecnption be.o„ of the Surfscan SPI^ instmraent take „ froffl ^ 
descnbe-s a„o m a. y detection of semiconductor wafer and flbn surfaces, it will be 
understood that essentiaHy the same indent and process may be used for defect 
detectron of other samples such as noetic or optica , disks used for 
storage and flat panel display type devices. 

Inspecting a sample surface with oblique and/or norraa, Hlutnination beams 
can be implemented in a number of ways. Fig. , is a schematic view of a sample 
mspechon system 50 to illustrate a genera, se, up for dementing anomaly 
deteconusmg bom no™, and obuque illumination beams. A rad,a,ion source that 
provtdes radiation at one or more waveiengths in a wide electromagnetic specttum 
(mdudmg but no, limited to ultraviolet, visib,., infrared, may be used, such as a laser 
52 provtdtng a laser beam 54. An electron beam or the like can be used as the 
radtatton source as weU. A ,ens 56 focuses the beam 54 through a spatial Alter 58 
and lens 60 collimates the beam and convey, i, to . polari2ing „ er 62 

Beamsphtter 62 passes a firs, polarized component to the norma, illumination 
channel and a second po.arized component to the oblique illumination channel 
where the f,rs, and second components are orthogonal. In the norma, illumination 
channel 70, the firs, po.arized component is focused by optics 72 and reflected by 
rmrror 74 towards a sample surface 76a of a semiconductor wafer 76. The radiation 
scattered by surface 76a is collected and focused by an ellipsoidal mirror 78 ,o a 
photomultiplier tube 80. 

In the oblique ulutnination channel 90, the second polarized component is 
reflected by beamsplitter 62 to a mirror 82 which reflects such beam through a half- 
wave Plate 84 and footed by optics 86 to surface 76a. Radiation originating from 
theob tqueillumination beamin the oblique channel 90 and scattered by surface 76a 
- ^collected by an ellipsoidal mirror and focused ,o photomultiplier tube 80 
Photomultiplier tube 80 has a mask or pinhole entrance 80a. The mash or pinhole 
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80a and the muminated spot (fr om , he nomia| and ^ ^ ^ 
surface 76a) are preferably a. the foci of the ellipsoidal mirror 78 

Wafer 76 is rotated by a motor 92 which is also moved linearly by transducer 
94, and both movements are controlled by a controller 96, so that the normal and 
obltoueiUummation beams in channel 70 and 90 scan surface 76a along a spiral scan 
to cover the entire surface. 

As described in the companion application, scattered radiation from the 
wafer surface originating from the normal illumination path and from the oblique 
■nomination path may be distinguished in a number of different ways, such as by 
usmg different poUrization or wavelength of light in the two different channels 

For patterned semiconductor wafers, the AIT II tool is a laser scanning 
-mspection system from the assignee of the present application, KLA-Tencor 
Corporation, which may be used when inspecting such surfaces. A description of 
such system is set forth in U.S. Patent No. 5,864,394, which is incorporated herein 
m ,,s entirety by reference. While the description below of the AIT II instrument 
taken from such patent describes anomaly detection of semiconductor wafer 
surfaces, it win be understood that essentially the same instrument and process may 
be used for defect detection of other sara p,e s such as magneticor optica, disks used 
for mformarion storage and flat pane! display type devices, or photomask or ceramic 
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Another surface inspection system of this appl.cation will now be described 
tn reference to Fig. 2. As shown in Fig. 2, system ,20 includes a laser 122 providing 
a laser beam 124. Beam ,24 is expanded by beam expander ,26 and the expanded 
beam 128 ,s deflected by an acous.o-optic deflector (AOD) ,30 into * defected 
beam ,32. The deflected beam 132 is passed through post-AOD and polarization 
selection optics 134 and the resulting beam is focused by telecen.nc scan lens 136 
a* a focused beam ,38 onto a spot 1,0 on surface ,40 to be inspected, such as tha. 
of a semiconductor wafer, photomask or ceramic tile, patterned or unpatterned 

In order to move the illuminated area tha, is focused onto surface 140 for 
scanning the entire surface, The AOD 1 30 causes the deflected beam l32,ocha„ge 
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in direction, thereby causing the illuminated spot 1 10 on surface 140 to be scanned 
along a sweep 150. As shown in Fig. 2, sweep 150 is preferably a straight line 
having a length which is smaller than the dimension of surface 140 along the same 
direction as the sweep. Even where sweep 150 is curved, its span is less than the 
dimension of surface 140 along the same general direction. While the illuminated 
spot is scanning surface 140 along sweep 150, surface 140 of the wafer is moved by 
an XY stage (not shown) along the X axis so that the illuminated area of the surface 
moves along arrow 152 and AOD 130 causes the illuminated spot to scan a sweep 
152 parallel to sweep 1 50 and in an adjacent position spaced apart from sweep 1 50 
along the negative X axis. As the illuminated spot covers said sweep, surface 140 
is moved by a small distance so that the area of the surface to be illuminated is 
moved along direction 152 in order to scan an adjacent sweep at a different X 
position. As described in U.S. Patent 5,864,394, this small distance is preferably 
equal to about one quarter of the dimension of spot 1 10 in the X direction. This 
process is repeated until the illuminated spot has covered strip 1 54;- at this point in 
time the illuminated area is at or close to the edge 154a. At such point, the surface 
140 is moved by the XY stage along the Y direction by about the length of sweep 
150 in order to scan and cover an adjacent strip 156, beginning at a position at or 
close to edge 156a. The surface in strip 156 is then covered by short sweeps such 
as 150 in a similar manner until the other end or edge 156b of strip 156 is reached 
at which point surface 150 is again moved along the Y direction for scanning strip 
158. This process is repeated prior to the scanning of strip 154, 156, 158 and 
continues after the scanning of such strips until the entire surface 140 is scanned. 
Surface 140 is therefore scanned by scanning a plurality of arrays of sweeps the 
totality of which substantially covers the entire surface 140. 

The deflection ofbeam 132 by AOD 130 is controlled by chirp generator 180 
which generates a chirp signal. The chirp signal is amplified by amplifier 182 and 
applied to the transducer portion of AOD 130 for generating sound waves to cause 
deflection ofbeam 132 in a manner known to those skilled in the art. For a detailed ' 
description of the operation of the AOD, see "Acoustooptic Scanners and 
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Motors." by Milton Gottlieb in QslkaL&aj ^ ed fcy ^ R 
Dekker .99,, pp. 6 15 -o 85 . Briefly, the sound waves generated by the transduced 
portion of AOD 130 modulate the optical refractive index of an acoustooptic crystal 
tn a penodic fashion thereby leading.o deflection of bean, 132. Chirp generator 180 
■ generates appropriate signals so ,ha, after being focused by lens ,36, the deflection 
of beam 132 causes the focused beam to scan along a sweep such as sweep 150 in 
the manner described. 

Chirp generator 180 is controlled by timing electronic circuit 184 which in 
the preferred embodiment includes a microprocessor. The microprocessor suppiies 
the beginning and end frequencies fl, fi to the chirp generator 180 for generating 
appropnate chirp signals to cause the deflection of beam 132 within a predetermined 
range of deflection angles determined by the frequencies fl, fl. The auto-position 
sensor (APS) optics 190 and APS electronics 192 are used to detect the level or 
height of surface 140. Detectors such as detectors 1 10a, 1 10b, 1 1 la .1 lb of Fig 
2 collect light scattered by anomalies as well as the surface and other structures 
thereon along sweep 150 and provide output signals to a processor (not shown 
which maybe a set of processors, one for each detector) in order to detect anomalies 
and analyze their characteristics. 

The systems in Figs. 1 and 2 described above are what is known as dark field 
systems, where the radiation scattered by the anomalies and collected for detection 
is m directions away from the specular reflection direction of the illuminating beam 
Anomaly information may also be obtained by a different type of system known as 
imaging detection systems, such as that shown in Fig. 3. 

As shown in Fig. 3, the input laser beam 202 of the imaging detection system 
200 is focused by a lens 204, reflected by mirror 206 and then collimated by lens 208 
to sample surface 210, which may include a repetitive pattern, or be an unpatterned 
surface: Different from thedark field systems of Figs. 1 and 2, the area of surface 
212 illuminated by the collimated beam 210 is typically much larger than that of the 
darkfield systems ofFigs. land 2. The illumination scheme in the imaging system 
of F,g. 3 is known in the industry as flood iilumination. Thus, the area 212a of 
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sa«plesu I face212iIIuminatedbybeam210 m aycontainmanydefects Lenses208 
and 216 focus light scattered by a defect such as defect 220 to a corresponding spot 
of an unage plane 222 of lenses 208, 216, f orming an image 220a ^ ^ ^ 
m the .mage plane 222. Typically, an array of charged coupled devices (CCDs) is 
5 placed « the image plane so that the defect image 220a falls on a corresponding 
CCD in the array. 

As noted above, area 212a is large and may contain more than one defect 
such as defect 230. Light scattered by defect 230 in area 212a is also focused by 
lenses 208 and 216 onto the image plane but at a different location than defect image 
220a, such as defect image 230a, shown in Fig. 3. Since the detector in the CCD 
array detecting the scattered light forming the defect image 230a is different from 
the detector detecting the defect image 220a, the two defects 220 and 230 can be 
dnTerentiated and the resolution of the system depends on the collection optics and 
the spatial resolution of the CCD array and not on the optics illuminating samp , e 
surface 212. This is in contrast to the dark field systems of Figs. 1 and 2 where the 
mummation beam is focused onto a small spot typically of several to tens of microns 
in cross-sectional dimensions, where the signal-to-noise ratio is improved by 
reducmg the spot size. In system 200, as noted above, the resolution is controlled 
by the collection optics (such as lenses 208, 216) and the spatial dimensions and 
speed and time of signal accumulation of the detector array. 

Beam 210 is caused to scan the surface 212 by causing relative motion 
between the sample surface 212 and the beam, so that a defect map of sample 
surface 212 is obtained in a manner known to those skilled in the art. As described 
above, the dark field systems of Figs. 1 and 2 scan the entire surface of the sample 
and provide defect maps of the sample. Fig. 4 is a schematic view of a simulated 
d.stnbutzon of anomalies from a detection system such as those shown in Figs 1 3 
and 10 (to be described below), where the size of each anomaly is indicated in 
microns. 

The detection systems such as those shown in Figs. 1-3 and 10 employ 
detectors that convert radiation scattered or reflected by anomalies towards 
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detectors that convert such radiation signal to electrical signals. The electrical 
signals are then processed by a processor which may be a digital signal processor 
microprocessor, programmable logic or other processing circuits. The processor 
will then process the signals from the detectors to analyze and classify the anomalies 
The combined inspection and processor system has an inherent noise level. Noise 
can also be caused by stray radiation collected and detected by the detector where 
the stray radiation did not originate from the anomalies. Optical imperfections 
variations in the illumination intensity and other environmental factors may also 
affect the noise level in the inspection/processor system. Therefore, there is an 
inherent noise level which sets the limits for detection and classification of anomalies. 

In one way to increase the amount of information obtained from the 
inspection/processor system, the sensitivity of the detector can be set to the highest 
value. As noted above, the detectors used in the systems of Figs. l-3and lOinclude 
photomultiplier tubes, photodiodes and CCD arrays. In all of these detectors, signal- 
to-noise ratio can be improved by increasing the time of detection. Since the noise 
signals collected by the detectors are substantially of a random nature whereas the 
signals containing information concerning the anomalies are not, by collecting the 
radiation signals for a longer period of time, the effect of the random signals will 
tend to cancel whereas the signals containing anomaly information would not 
thereby increasing the signal-to noise ratio. It is also possible to increase gain 
control to increase detection sensitivity. Thus, gain control can be increased to the 
maximum in the photomultiplier tube to provide the maximum amplitude electrical 
signal in response to the detection of radiation signals. 

mtheprocessorthatprocessestheelectricalsignalprovidedbythedetectors 
to analyze and classify the anomalies, there is a noise level of the combined detector 
and processor system, where only signals of amplitudes above this noise level may 
be processed to yield useful information concerning the anomalies. 

Another possible limitation is the processing speed of the processor of data 
concerning anomalies from the detector where the result of the analysis and 
classification should be available on a real time or substantially real time basis. Thus 
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where the invention of this application is applied to detecting and classifying 
anomahes on a semiconductor wafer, a certain amount of time is required to remove 
the wafer that has been inspected and replace it with one that has not been in order 
to perform anomaly detection in an online production process of electronic devices 
5 If the speed of the processor is fast enough to analyze and classify the anomalies on 
one wafer that has been inspected before or not long after inspection data of the next 
wafer reaches the processor, there is no need to significantly slow down the 
production process in order to accommodate the speed of the processor In such 
circumstances, the processor is able to analyze and classify the anomalies on a rea! 
10 time or substantially real time basis. 

In order to obtain the maximum information by means of the detection 
systems of Figs. 1-3 and 10, it is preferable to set the sensitivity of the detection 
system to the highest level. This may mean, for example, setting the gain of the 
photomultiplier tube to the highest level, and.collecting radiation from the anomalies 
5 and detecting the collected radiation for as long as would be practical for the user 
When applied in a production process, for example, where results of the analysis and 
classification are desirable on a real time or substantially real time basis the 
collection time may be set to a maximum value while avoiding significant delays in 
the production process. 

5 In the processor processing the output of the detector in Figs 1 -3 and 10 

a signal threshold is typically set at a level that is above the system noise level for 
analysis of the detector output. To obtain the maximum information concerning the 
anomalies from the detector output, it would be desirable to set the threshold to as 
low a level as possible above the noise level. Again, when applied in an online 
production process where analysis and classification of the anomalies is desired on 
a real tune or substantially real time basis, such threshold is preferably set as low as 
possible without delaying the production process. Where the detector output is 
stored and processed on a non-real time basis, the threshold may be set at a level 
which is slightly above the noise level, for example. 
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In existing anomaly inspection systems, the sensitivity of the detector and/or 
the threshold of the processor may no. have been set to provide the maximum 
.formation, since such information may overwhelm theuser. For examp.e, if certain 
small size defects do no, significant affect the yield of certain electronic devices 
> febncateo from the wafer, the user may not wish to be aware of such anomalies 
Furthermore, where anomaly information is provided in the form of a wafer map for 
example, the presence of information on a large number of immaterial small siae 
anomalies may render I, difficult for the user to discover and focus on the larger 
srzed anomalies which do affect yield. For this reason, in conventional anomaly 
detection systems, the sensitivity of the detector and/cr the ,hresho.d of the 
processor may no. have been se, to provide ,he maximum information, since such 
tnfonnation may conceal the really significant anomalies tha, are of concern amongst 
thousands of immaterial small anomalies. 

M noted above, the conventional method for defect analysis and 
caseation is by means of optical microscopes, confocai review stations, scanning 
electron microscopes and atomic force microscopes which are typically very slow 
When the large anomalies of concern are present on the wafer map amongst 
thousands of small anomalies not of concern, it will be difficult to apply suc h 
conventional techniques to each of the anomalies. On the other hand, where the 
sensitivity of the detector and the threshold of the processor are not set to provide 
the maximum information, it may be difficult for the user to distinguish between a 
pomt defect such as a particle from a microscratch. This invention is based on the 
observation that, by analyzing information on the anomalies at different detector 
sensitmties and/or processor thresholds, it is possible to distinguish point defects 
such as particles from microscratches without inundating the user with extraneous 
and useless information. 

Preferably, the sensitivity of the detector is se, ,o ,he highest level and the 
processor threshold is se, to the .owes, possible level to provide ,he maximum 
■"formation on the anomalies. The processor then performs an analysis of the 
de.ec.or output to provide a firs, classification of the anomalies. Then thesensitivity 
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of the detector and/or the threshold of the processor is altered and the detector 

cutput.sagainanalyzedandtheanomaliesclassifiedmasecondclassificat^ If the 
first classification is obtained at the highest sensitivity and the lowest threshold the 
second classification would then be obtained at a lower sensitivity and/or higher 
> threshold to screen out extraneous information useless to the user. What appears 
as pent anomalies in the second classification may turn out to be scratches in the 
first classification. Therefore, while the second classification may be used as the 
bas,s of a presentation of sample defects to the user, such as in the form of a wafer 
map, the second classification may be supplemented or altered using the first 
class.ficationtoprovidemoreaccurateinformationtotheuser.Forexam if what 
appears to be point anomalies in the second classification are actually scratches in 
the first classification, then a wafer map may be constructed using the second 
class-fication but where the point anomalies in the second classification are altered 
to m dl cate microscratches. Such modified and improved wafer map may then be 
prodded to the user. This process is illustrated in reference to Figs, 4-6 below 

Fig 4 is a schematic view of a simulated distribution of defects after a scan 
usmg any one of the systems of Figs. 1-3 and 10. Obviously the number of defects 
or anomalies in actual wafer maps may be much larger than shown in Fig 4 The 
s,ze of the defects are shown in microns in Fig. 4. A clustering algorithm is then 
performed by the processor for grouping the defects together. The algorithms 
performed by the processor may include Search Distance, Minimum Events Aspect 
Rat,o (AR) and Minimum Le ngth . If there h a ^ on ^ ^ ^ 

Afferent portions of the scratch along the length of the scratch may appear as a 
separate defect in the defect map. In order to identify a scratch, what may appear 
to be .solated defects or anomalies located in proximity to each other may need to 
be grouped together first so that a scratch may be identified. This is accomplished 
by the Search Distance algorithm. In this process, for each defect on the wafer map 
the processor searches for one or more defects located within a certain radius from 
such defect. When such other defect or defects are found, this process is repeated 
for such defect or defects found and the process continues until no further defect is 
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found within the radius of the last defect. The defects so found then are examined 
astowhethertheyaretobegroupedtogether. In reference to group C of Fig 5 for 
example, the defect of size 0. 1 i m is fo und in th , Search Distance ^ ^ 

wtthm a certain set distance from the defect of size 0.09pm, using the defect of size 
5 0.09pm as center in the algorithm. Then using as the center the defect of size 
0. 1 lp m , a third defect of size 0.1p m is fo und to 5e within saidradius TWs process 
is repeated until the processor finds the fifth and last defect of size also of 0 lp m 
after which no further defect is found except for the previous.y mentioned defects ' 
Addmonal algorithms are then applied to these five defects to see whether they are 
10 to be grouped together. 

The combination of the chemical reaction and abrasive behavior of the CMP 
process may cause a large drop of abrasive solvent to regain on the sample surface 
causmg a shaUow depression on the sample surface that appears in the defect map 
as a number of defects spread over an area. Or certain droplets of abrasive solvent 
may land in the vicinity of each other on the wafer surface causing several small 
defects to be located in the vicinity of each other. It would be useful to be able to 
d,stmguish between such defects. Thus, the defects that indicate a shallow 
depression represent an area anomaly and are preferably grouped together as an area 
defect. The isolated defects caused by the droplets are individual defects not 
3 necessarily correlated with one another, and are preferably not grouped together but 
remain as individual defects. Then there are the defects that appear as a string that 
stem from a scratch and are preferably classified as a scratch. 

Individual defects can be distinguished from scratches and area anomalies by 
requ,n„ g that a certain minimum number of defects be required to form a scratch or 
area defect. Therefore, in the above-described process of the Search Distance 
algorithm, if the number of events, defects or anomalies found in the above- 
described process is less than a certain preset minimum, then the defects may not all 
ongmate from a single scratch or area defect, and will not be grouped together 
After the Search Distance algorithm has been performed, if the number of such 
events, defects or anomalies found within a certain distance from at least one other 



10 



15 



>0 



o ° . • o o 



IS 



of such events, defects or anomalies in the same process of .he Search Distance 
algonthn, exceeds a preset value in the Minimum Event, algorithm, the defects are 
then grouped into groups, such as groups A, C andF in Fig. 5 Thus, if the preset 

""^-"•"eMimmumEventsaigorithmisfour.fcrexample.thefivedefectsfound 
m the above-described process are then grouped together as group C 

In order ,o distinguish area defects or anomalies from scratches the 
processor apples an Aspect Ratio algorithm. In order for a particular group of 
defects or anomalies to be identified as a scratch instead of an area anomaly the 
geographical distribution of defects are preferably such tha, such geographical 
d.stnbution is elongated and resembles that of. scratch rather than tha, of an area 
Tins may be ascertained by drawing a boundary (such as lines 280 282 284 in 
F.g.5) around the group and computing the ratio of a length , , 0 a widttl w of , he 
boundary as indicated in Fig. 5. If , he ratio is sma , |ef |ha „ , ^ ^ ^ 
then the group of defects forms an area type anomaly rather than a scratch Thus' 
for the groups or clusters A-F shown in Fig. 5, where the minimum number of 
defects required for a scratch or area is se, to a value such as 4 and aspect ratio for 
a scratch set to 3, clusters or groups A, CorF are identified as scratches, whereas 
groups B, D and E are identified as point or individual defects or anomalies No 
area defects are shown in Figs. 4-6. 

For certain semiconductor manufacturers, some of the defects or anomalies 
shown ,„ Fig. 5 may be too small to be of concern, such as the defect of size 0 , urn 
tn group E or the 5 defects of sizes up ,o and including 0.1 lpm in group A Thus 
another wafer map such as that shown in Fig. 6 may be obtained by reducing the 
sensitivity of detection of the detectors and/or raising the thresholds for signal 
processing by the processor, so tha, defects smaller than a certain size are eliminated 
as shown in Fig. 6. Thus, by decreasing the detection sensitivity and/or by raising 
«he stgna. threshold in the processor so tha, only defects of sizes 0,1 2pm and above 
are shown and executing the above-described algorithms, one obtains the anomaly 
classification in the wafer map of Fig. 6. In reference to Figs , „ ^ ^ 
cause groups A, D and E ,o be eliminated. The defects originally in group F in Fig 



o o Q o 



17 



5 , S now broken up into two different groups P and G in Fig ,6, since the linidng 
defects of sizes 0.09am and 0.1pm now no ionger appear in Fig. 6. By comparing 
the two wafer maps in Figs. 5 and 6, it is possible to characterize the anomaly in 
group C as a scratch instead of a point defect in the wafer map of Fig. 6 and i, is also 
5 possible to identify groups F and G as stemming from the same scratch ^ - 
a shorter scratch F' and a point defect G. 

In a conventional system, to avoid the problem that anomalies of sizes of 
concern are buried amongst a large number of small defects not of concern the 
sensitivity of the detector and the threshold of the processor are typically not set to 
0 prov.de the maximum information, so that the defect map provided is typically that 
showninFig. 6. As noted above, if the user is provided only with the wafer map of 
F.g. 6 without the benefit of the map of Fig. 5, the user would identify the defect in 
groups C and G as point defects and only group F' as relating to a scratch 
Furthermore, group G would appear to be unrelated to group F'. This may lead to 
5 different remedies for the wafer cleaning and CMP processes. 

For example, the wafer map of Fig. 6 may reveal far fewer scratches than 
there are in reality so that the user may attempt to solve the problem by adjusting the 
cleanmg process rather than altering the CMP process. This may be inadequate to 
correct the problem. If, however, the group C is also identified to be a scratch and 
that groups G and F' are identified as stemming from the same scratch, the problem 
of the scratches on the wafer may be more serious than what will be apparent from 
a smgle wafer map of Fig. 6. This may cause the user to adopt a different solution 
to theproblem by also altering the CMP process. For certain users, it may be useful 
to provide both wafer maps of Figs. 5 and 6 to provide more information to the user 
for monitoring or altering the cleaning and CMP processes. 

Instead of having to perform the above-described various algorithms twice 
to arrive at two different wafer maps using different detector sensitivity or 
processing thresholds, such as the wafer maps of Figs. 5 and 6, it is possible to 
simplify the process by performing the algorithms only once to arrive at the wafer 
map at Fig. 5 at the maximum sensitivity of the detectors, or using the lowest 
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Or**, in processing. As noted above, such map may contain too ^ 
-» anomaly no, of inters or concern ,o the user The size of the an " 

: f ~r; athasbeenset ^^*-««.«»LJ£ 

-Ofhede t ecto rt oac. rt ain,eve 1 ,on ly defec tS aboveace rt a i „ si2ewouWbedete J 

a „am s,ze g,v,„g rise ,o outpu. sig„a, s abov6 the (hresho|d ^ ^ « 
wafer map to be displayed or sen, ,o the user. 

Using .he above-described modified and simphfied process, i, „ possible to 

P ~;: ^ ^ " Rg ^ «" ~ * «~* C and gro 

F and O „,„ be accural Cashed in ,be same manner when wafer maps o f bom 

F.g, 5 and 6 are obfained. Wbere one is conSdem ma. ad«,ua.e anomaiy ' 

nformauon may be obfained wifhou. se„i„g .he defecfor sensi.ivijto the ^ 

•ev< and/or setting ,Ke processing rbresboid fo me iowes, ,eve, , is possib f to 

momfied process a, a de«ec,or sensitivity lass then the ^ " 

P— ^eaho.dnigHer.han.be.oweafpoasime.eve, An such variations ar 
withm ,he scope of the invention. 

F f 7A - 7B - S-phica, Mustratious of tbe intensity of radiation 
de ec.ed .om e ongated anomahes sucb as scratches to i„ua,a t e an elbodime 
*« mvenuon, whefe the cross-section of the scratches in Pi g , 7A , 7B are dm J 
As shown m Fig. 7A . the ^ 302 indica(e (he 

*o„g al engmofascra,ch, where , he cross-sectiona, dimension, 303 of fhesc^ 

g ::rr are 3,50 swn in Fig ?a - — >■ * «■ - 

gen., s , from one e „ d rf [he ^ (o o(her ^ ^ 

d«ec,d rad,a.,o„ signa, aiong ,he .engfh aiso changes s ,ow,y. As w i„ he Led J 

F,g. 7A .he deeper ,he portion of .he anomaly, .he higher wi„ be the intensity of the 
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signal detected by the detector. Also illustrated in Fig. 7A are the two thr \ u 
SHI and W? „e *u tW0 thres hoIds 

SHI and SH2 of the processor for processing. Where the signa, processina 

of the processor has been se, ,o SH, ,„ p rovide m l J^Z 

rr a8 :;?; isob,ained ^-^^-^^^ 

^ea„o,aof*e P rooe SS „r i s S eaova lu eSH 2 ,oprov id ea S ec.„ d c,ass in c, tio „ 
A company of th e two classifications ma y ,,he„ b e used ,o provide a better 
characenaation of the defects in the manner described above 

A,te ™ tiv * i ' i - l -P-ib,e t ocha„ 8 e«bede,ec l io„sen S i,ivi, yDSoflhe 
de^or.ns.eadof.be.nresbo.dofrbeprocesaor. ThisisihustratedinF^ S A Th e 
s-gria, tnte„sit y 30 2 detected b y ,be detector in Fig. 7A „ assumed to „.* , M 
^rsensitivi, yDS ,. The signa, i„ t ensi ty 30. detected by the aame de t e o 

from the same scratch shown in Fig 7 A hut 9t »in a ■ • 

rig. /A but at a lower detecti S ensitivitv DS? ic 

«be fi r St c 1 ass, fl ca t io„a„d th e S ig„a Ii „ t e nsit ies304 t oprovideasecc„dc,assi fi ca tion " 
a companson of.be two Cassations win y ie,d uaeftd information in ,„ e ^ 

22 above . 11 is possiwe ' ° f — ,o ^ b - - sot; 

d «ec or _„es DS ,o yi e,d simi,ar resuhs. Wbere otdy one parameter i 

Zi:::n time at a differem *— — — - - - 

I •!! , UCh0n e " Vir0nmen '' ^ *°" " ^ 

clustenng or class , flca „ on are preferaWe . wji) ^ und ^ (ood 

a-gonthms may be used, such as t bose described in U.S. Paten, 5,99 , ,o99 which 
■ncorporated herein by reference in i, s emirety 

— JL* 31,0 " 8raPWC " »f the of radiation 

detected from the same eiongated anomalies as those illustrated in Figs 7A 7B b„, 

2 .™ and 8B i„„ strate „, y (tK same cons|deraiions ^ « ■ J£ 
^ fr ° m 3 SCra ' Ch tha ' ^ « one end, so tha, the i„,ensh y signa, 
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increases gradually along the length of the scratch but then drops off abruptly at one 
end; 

Fig. 9 is a schematic view of the scattering cross-section and non-elongated 
anomaly, such as a particle. 

Fig. 10 is a schematic diagram of a sample inspection and processing system 
410 illustrating an information flow during the inspection and processing of a sample 
76. As shown in Fig. 10, a radiation source and supply system 412 provides 
radiation to a sample 76 and the radiation provided by system 412 and scattered or 
reflected by anomalies on sample 76 are collected and detected by collection and 
detector system 414. The detector output from system 414 is provided to a 
processor 400 for analysis and classification of the anomalies. The classifications 
provided by processor 400 are then provided to a processing device 416 for 
processing sample 76 using information from the classifications. For example if the 
classifications provided by processor 400 indicate that the anomalies are mostly 
particles that can be removed by cleaning process, then a CMP process performed 
by processing device 416 does not need to be altered for process.ng samples such 
as sample 418. Whereas if the classifications provided by processor 400 indicate a 
significant number of scratches, processing device 416 may alter its CMP process 
to reduce the number of scratches caused when processing sample 418. 

System 412 may provide any type of radiation to sample 76 from which 
radiation from anomalies may be detected by system 4 14 for analysis by processor 
400. Thus, source 412 may be an electron source so that system 412, system 414 
and processor 400 form an instrument similar to a scanning electron microscope 
System 4 12 may also provide electromagnetic radiation in a frequency or wavelength ' 
rangedifferentfromthoseofinstrumentsinFigs. 1-3. The electromagnetic radiation 
provided by system 412 may include x-ray, ultraviolet, infrared or microwave 
radiation. 

At the present state of the art, the inspection system comprising system 4 1 2 
system 414 and processor 400 typically forms an instrument separate and apart from 
the processing device 416. Given the trend of integration in the semiconductor 
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equipment industry, it may be desirable to integrate the inspection/ detection system 
together with the processing device to form an integrated system 4 1 0, including both 
the lnS pection components 412, 414, 400 and the processing component 416 in the 
same instrument 410. 

While the invention has been described above by reference to various 
embodiments, it will be understood that changes and modifications may be made 
without departing from the scope of the invention, which is to be defined only by the 
appended claims and their equivalents. All references referred to herein are 
incorporated herein by reference in their entirety. 



